Search CORE

1,879 research outputs found

Quantifying Selective Reporting and the Proteus Phenomenon for Multiple Datasets with Similar Bias

Author: A Coursol
A Yesupriya
AJ Sutton
AJ Sutton
BC Martinson
CJ Hoggart
D Fanelli
DV Lindley
FK Kavvoura
Giuseppe Biondi-Zoccai
J Lau
J Little
JD Nelson
JL Tang
John P. A. Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
K Dickersin
K Dwan
L Bertram
Lars Bertram
LV Hedges
LV Hedges
PJ Easterbrook
R DerSimonian
R Rosenthal
S (1997) Kullback
S Iyengar
S Wacholder
SG Moreno
T Pfeiffer
Thomas Pfeiffer
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Meta-analyses play an important role in synthesizing evidence from diverse studies and datasets that address similar questions. A major obstacle for meta-analyses arises from biases in reporting. In particular, it is speculated that findings which do not achieve formal statistical significance are less likely reported than statistically significant findings. Moreover, the patterns of bias can be complex and may also depend on the timing of the research results and their relationship with previously published work. In this paper, we present an approach that is specifically designed to analyze large-scale datasets on published results. Such datasets are currently emerging in diverse research fields, particularly in molecular medicine. We use our approach to investigate a dataset on Alzheimer's disease (AD) that covers 1167 results from case-control studies on 102 genetic markers. We observe that initial studies on a genetic marker tend to be substantially more biased than subsequent replications. The chances for initial, statistically non-significant results to be published are estimated to be about 44% (95% CI, 32% to 63%) relative to statistically significant results, while statistically non-significant replications have almost the same chance to be published as statistically significant replications (84%; 95% CI, 66% to 107%). Early replications tend to be biased against initial findings, an observation previously termed Proteus phenomenon: The chances for non-significant studies going in the same direction as the initial result are estimated to be lower than the chances for non-significant studies opposing the initial result (73%; 95% CI, 55% to 96%). Such dynamic patters in bias are difficult to capture by conventional methods, where typically simple publication bias is assumed to operate. Our approach captures and corrects for complex dynamic patterns of bias, and thereby helps generating conclusions from published results that are more robust against the presence of different coexisting types of selective reporting

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Measuring co-authorship and networking-adjusted scientific impact

Author: AFJ Van Raan
AFJ Van Raan
AG Mainous 3rd
AL Kinney
CD Kelly
D Rennie
DJ Shulkin
E Garfield
E Tarnow
Etienne Joly
G Mowatt
GB Parker
HA Abt
JE Hirsch
JE Hirsch
John P. A. Ioannidis
JP Drenth
JP Ioannidis
JP Ioannidis
JT King Jr
KS Khan
L Egghe
M Enserink
M Wendl
MD Fetters
MD Fetters
N Bakkalbasi
PC Gøtzsche
PD Batista
RD Powers
RJ Epstein
RM Slone
RN Kostoff
S Papatheodorou
V Ilakovac
V Yank
WB Weeks
Y Hama
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2008
Field of study

Appraisal of the scientific impact of researchers, teams and institutions with productivity and citation metrics has major repercussions. Funding and promotion of individuals and survival of teams and institutions depend on publications and citations. In this competitive environment, the number of authors per paper is increasing and apparently some co-authors don't satisfy authorship criteria. Listing of individual contributions is still sporadic and also open to manipulation. Metrics are needed to measure the networking intensity for a single scientist or group of scientists accounting for patterns of co-authorship. Here, I define I1 for a single scientist as the number of authors who appear in at least I1 papers of the specific scientist. For a group of scientists or institution, In is defined as the number of authors who appear in at least In papers that bear the affiliation of the group or institution. I1 depends on the number of papers authored Np. The power exponent R of the relationship between I1 and Np categorizes scientists as solitary (R>2.5), nuclear (R=2.25-2.5), networked (R=2-2.25), extensively networked (R=1.75-2) or collaborators (R<1.75). R may be used to adjust for co-authorship networking the citation impact of a scientist. In similarly provides a simple measure of the effective networking size to adjust the citation impact of groups or institutions. Empirical data are provided for single scientists and institutions for the proposed metrics. Cautious adoption of adjustments for co-authorship and networking in scientific appraisals may offer incentives for more accountable co-authorship behaviour in published articles.Comment: 25 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Decision-Making in Research Tasks with Sequential Testing

Author: A Rzhetsky
A Tatsioni
Alan Ruttenberg
Anna Dreber
AR Palmer
C Howson
C Zimmerman
D Kahneman
David G. Rand
DV Lindley
GL Wells
H Campbell
JD Nelson
JP Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
LM Slowiaczek
LR Anderson
M Henrion
PC Wason
R Hanson
R Hoffmann
RD Csada
S Bikhchanidani
SN Goodman
SN Goodman
T Gilovich
T Pfeiffer
T Pfeiffer
Thomas Pfeiffer
W Edwards
Publication venue: Public Library of Science
Publication date: 25/02/2009
Field of study

Background: In a recent controversial essay, published by JPA Ioannidis in PLoS Medicine, it has been argued that in some research fields, most of the published findings are false. Based on theoretical reasoning it can be shown that small effect sizes, error-prone tests, low priors of the tested hypotheses and biases in the evaluation and publication of research findings increase the fraction of false positives. These findings raise concerns about the reliability of research. However, they are based on a very simple scenario of scientific research, where single tests are used to evaluate independent hypotheses. Methodology/Principal Findings: In this study, we present computer simulations and experimental approaches for analyzing more realistic scenarios. In these scenarios, research tasks are solved sequentially, i.e. subsequent tests can be chosen depending on previous results. We investigate simple sequential testing and scenarios where only a selected subset of results can be published and used for future rounds of test choice. Results from computer simulations indicate that for the tasks analyzed in this study, the fraction of false among the positive findings declines over several rounds of testing if the most informative tests are performed. Our experiments show that human subjects frequently perform the most informative tests, leading to a decline of false positives as expected from the simulations. Conclusions/Significance: For the research tasks studied here, findings tend to become more reliable over time. We also find that the performance in those experimental settings where not all performed tests could be published turned out to be surprisingly inefficient. Our results may help optimize existing procedures used in the practice of scientific research and provide guidance for the development of novel forms of scholarly communication.Engineering and Applied SciencesPsycholog

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Reporting of Human Genome Epidemiology (HuGE) association studies: An empirical assessment

Abstract Background Several thousand human genome epidemiology association studies are published every year investigating the relationship between common genetic variants and diverse phenotypes. Transparent reporting of study methods and results allows readers to better assess the validity of study findings. Here, we document reporting practices of human genome epidemiology studies. Methods Articles were randomly selected from a continuously updated database of human genome epidemiology association studies to be representative of genetic epidemiology literature. The main analysis evaluated 315 articles published in 2001–2003. For a comparative update, we evaluated 28 more recent articles published in 2006, focusing on issues that were poorly reported in 2001–2003. Results During both time periods, most studies comprised relatively small study populations and examined one or more genetic variants within a single gene. Articles were inconsistent in reporting the data needed to assess selection bias and the methods used to minimize misclassification (of the genotype, outcome, and environmental exposure) or to identify population stratification. Statistical power, the use of unrelated study participants, and the use of replicate samples were reported more often in articles published during 2006 when compared with the earlier sample. Conclusion We conclude that many items needed to assess error and bias in human genome epidemiology association studies are not consistently reported. Although some improvements were seen over time, reporting guidelines and online supplemental material may help enhance the transparency of this literature.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository

International ranking systems for universities and institutions: a critical appraisal

Author: AFJ Van Raan
AFJ Van Raan
Athina Tatsioni
D Butler
DA King
DD Dill
DE Heller
Despina G Contopoulos-Ioannidis
E Garfield
E Garfield
Evangelos Evangelou
Fotini K Kavvoura
G Palla
George Liberopoulos
Ioanna Kouri
J Banatvala
J Giles
John PA Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
K Dickersin
M Bhandari
M Schein
NA Patsopoulos
NC Liu
Nikolaos A Patsopoulos
RG Green
RM Slone
RN Kostoff
S Tomlinson
T Drewes
WC McGaghie
Y Cheng
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Ranking of universities and institutions has attracted wide attention recently. Several systems have been proposed that attempt to rank academic institutions worldwide. Methods We review the two most publicly visible ranking systems, the Shanghai Jiao Tong University 'Academic Ranking of World Universities' and the Times Higher Education Supplement 'World University Rankings' and also briefly review other ranking systems that use different criteria. We assess the construct validity for educational and research excellence and the measurement validity of each of the proposed ranking criteria, and try to identify generic challenges in international ranking of universities and institutions. Results None of the reviewed criteria for international ranking seems to have very good construct validity for both educational and research excellence, and most don't have very good construct validity even for just one of these two aspects of excellence. Measurement error for many items is also considerable or is not possible to determine due to lack of publication of the relevant data and methodology details. The concordance between the 2006 rankings by Shanghai and Times is modest at best, with only 133 universities shared in their top 200 lists. The examination of the existing international ranking systems suggests that generic challenges include adjustment for institutional size, definition of institutions, implications of average measurements of excellence versus measurements of extremes, adjustments for scientific field, time frame of measurement and allocation of credit for excellence. Conclusion Naïve lists of international institutional rankings that do not address these fundamental challenges with transparent methods are misleading and should be abandoned. We make some suggestions on how focused and standardized evaluations of excellence could be improved and placed in proper context.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Transparent and accurate reporting increases reliability, utility, and impact of your research: reporting guidelines and the EQUATOR Network

Author: A Videnovic
AB Haynes
AC Plint
Allison Hirst
AW Chan
BC Martinson
D Moher
D Moher
D Moher
David Moher
DG Altman
Douglas G Altman
F Song
G Gigerenzer
I Pitrou
I Simera
Iveta Simera
J Roberts
JJ Kirkham
JJ Meerpohl
John Hoey
JP Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
K Dwan
Kenneth F Schulz
M Ware
MY Chowers
N Ahmad
N Smidt
P Glasziou
PLoS Medicine Editors
S Hopewell
S Hopewell
S Hopewell
T Groves
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Although current electronic methods of scientific publishing offer increased opportunities for publishing all research studies and describing them in sufficient detail, health research literature still suffers from many shortcomings. These shortcomings seriously undermine the value and utility of the literature and waste scarce resources invested in the research. In recent years there have been several positive steps aimed at improving this situation, such as a strengthening of journals' policies on research publication and the wide requirement to register clinical trials

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

The influence of the team in conducting a systematic review

Author: A Steckler
BJ Burford
CA Silagy
D Moher
D Moher
D Moher
D Spiegelhalter
E Basch
F Catalá-López
F Gómez-García
GH Guyatt
J Andrews
J Lomas
J Swan
J Wasiak
JJ Kirkham
JP Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
K Boudreau
K Pussegoda
M Bes-Rastrollo
M Egger
M Eisner
M Nasser
M Petticrew
ME Flacco
MJ Page
MJ Page
N Elia
P Glasziou
P Humaidan
PC Gøtzsche
R Borah
RF Dudden
RL Fleurence
S Ebrahim
S Goodman
SA Iqbal
T Munder
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2017
Field of study

There is an increasing body of research documenting flaws in many published systematic reviews' methodological and reporting conduct. When good systematic review practice is questioned, attention is rarely turned to the composition of the team that conducted the systematic review. This commentary highlights a number of relevant articles indicating how the composition of the review team could jeopardise the integrity of the systematic review study and its conclusions. Key biases require closer attention such as sponsorship bias and researcher allegiance, but there may also be less obvious affiliations in teams conducting secondary evidence-syntheses. The importance of transparency and disclosure are now firmly on the agenda for clinical trials and primary research, but the meta-biases that systematic reviews may be at risk from now require further scrutiny

Crossref

Directory of Open Access Journals

White Rose Research Online

Effectiveness of strategies to increase the validity of findings from association studies: size vs. replication

Author: A Ziegler
AD Skol
DC Thomas
Etienne Kaelin
Gerd Kallischnigg
Grégory Vuillaume
JN Hirschhorn
JO Lay Jr
JP Ioannidis
JP Ioannidis
KE Lohmueller
M Stephens
MJ Khoury
P Boffetta
PD Pharoah
RN Hoover
Rolf Weitkunat
S Wacholder
S Zehetmayer
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The capacity of multiple comparisons to produce false positive findings in genetic association studies is abundantly clear. To address this issue, the concept of false positive report probability (FPRP) measures "the probability of no true association between a genetic variant and disease given a statistically significant finding". This concept involves the notion of prior probability of an association between a genetic variant and a disease, making it difficult to achieve acceptable levels for the FPRP when the prior probability is low. Increasing the sample size is of limited efficiency to improve the situation. Methods To further clarify this problem, the concept of true report probability (TRP) is introduced by analogy to the positive predictive value (PPV) of diagnostic testing. The approach is extended to consider the effects of replication studies. The formula for the TRP after k replication studies is mathematically derived and shown to be only dependent on prior probability, alpha, power, and number of replication studies. Results Case-control association studies are used to illustrate the TRP concept for replication strategies. Based on power considerations, a relationship is derived between TRP after k replication studies and sample size of each individual study. That relationship enables study designers optimization of study plans. Further, it is demonstrated that replication is efficient in increasing the TRP even in the case of low prior probability of an association and without requiring very large sample sizes for each individual study. Conclusions True report probability is a comprehensive and straightforward concept for assessing the validity of positive statistical testing results in association studies. By its extension to replication strategies it can be demonstrated in a transparent manner that replication is highly effective in distinguishing spurious from true associations. Based on the generalized TRP method for replication designs, optimal research strategy and sample size planning become possible.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

An automatic method to generate domain-specific investigator networks using PubMed abstracts

Author: Ajay Yesupriya
Anja Wulf
BK Lin
DA Lindberg
FS Collins
JA Kremer
JP Ioannidis
JP Ioannidis
Junfeng Qu
Marta Gwinn
ME Newman
ME Newman
ME Newman
ME Newman
MJ Khoury
Muin J Khoury
O Tutarel
S Teasley
Wei Yu
Publication venue: BioMed Central
Publication date: 01/06/2007
Field of study

Abstract Background Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communication has promoted the development of highly networked research communities. Extending these networks globally requires identifying additional investigators in a given domain, profiling their research interests, and collecting current contact information. We present a novel strategy for building investigator networks dynamically and producing detailed investigator profiles using data available in PubMed abstracts. Results We developed a novel strategy to obtain detailed investigator information by automatically parsing the affiliation string in PubMed records. We illustrated the results by using a published literature database in human genome epidemiology (HuGE Pub Lit) as a test case. Our parsing strategy extracted country information from 92.1% of the affiliation strings in a random sample of PubMed records and in 97.0% of HuGE records, with accuracies of 94.0% and 91.0%, respectively. Institution information was parsed from 91.3% of the general PubMed records (accuracy 86.8%) and from 94.2% of HuGE PubMed records (accuracy 87.0). We demonstrated the application of our approach to dynamic creation of investigator networks by creating a prototype information system containing a large database of PubMed abstracts relevant to human genome epidemiology (HuGE Pub Lit), indexed using PubMed medical subject headings converted to Unified Medical Language System concepts. Our method was able to identify 70–90% of the investigators/collaborators in three different human genetics fields; it also successfully identified 9 of 10 genetics investigators within the PREBIC network, an existing preterm birth research network. Conclusion We successfully created a web-based prototype capable of creating domain-specific investigator networks based on an application that accurately generates detailed investigator profiles from PubMed abstracts combined with robust standard vocabularies. This approach could be used for other biomedical fields to efficiently establish domain-specific investigator networks.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Heterogeneity in Meta-Analyses of Genome-Wide Association Investigations

Author: AJ Sutton
AL Price
D Altshuler
DJ Spiegelhalter
E Evangelou
E Zeggini
Evangelos Evangelou
G Salanti
G Salanti
Gonçalo Abecasis
HS Sacks
J Lau
J Lau
J Little
JA Berlin
JA Todd
JL Fleiss
JN Hirschhorn
John P.A. Ioannidis
JP Higgins
JP Higgins
JP Ioannidis
JP Ioannidis
LJ Scott
M Egger
MR Munafo
MR Munafo
Nikolaos A. Patsopoulos
R Saxena
R Sladek
TB Huedo-Medina
TL Nelson
TM Frayling
V Steinthorsdottir
WG Cochran
Publication venue: Public Library of Science
Publication date: 05/09/2007
Field of study

BACKGROUND: Meta-analysis is the systematic and quantitative synthesis of effect sizes and the exploration of their diversity across different studies. Meta-analyses are increasingly applied to synthesize data from genome-wide association (GWA) studies and from other teams that try to replicate the genetic variants that emerge from such investigations. Between-study heterogeneity is important to document and may point to interesting leads. METHODOLOGY/PRINCIPAL FINDINGS: To exemplify these issues, we used data from three GWA studies on type 2 diabetes and their replication efforts where meta-analyses of all data using fixed effects methods (not incorporating between-study heterogeneity) have already been published. We considered 11 polymorphisms that at least one of the three teams has suggested as susceptibility loci for type 2 diabetes. The I2 inconsistency metric (measuring the amount of heterogeneity not due to chance) was different from 0 (no detectable heterogeneity) for 6 of the 11 genetic variants; inconsistency was moderate to very large (I2 = 32-77%) for 5 of them. For these 5 polymorphisms, random effects calculations incorporating between-study heterogeneity revealed more conservative p-values for the summary effects compared with the fixed effects calculations. These 5 associations were perused in detail to highlight potential explanations for between-study heterogeneity. These include identification of a marker for a correlated phenotype (e.g. FTO rs8050136 being associated with type 2 diabetes through its effect on obesity); differential linkage disequilibrium across studies of the identified genetic markers with the respective culprit polymorphisms (e.g., possibly the case for CDKAL1 polymorphisms or for rs9300039 and markers in linkage disequilibrium, as shown by additional studies); and potential bias. Results were largely similar, when we treated the discovery and replication data from each GWA investigation as separate studies. SIGNIFICANCE: Between-study heterogeneity is useful to document in the synthesis of data from GWA investigations and can offer valuable insights for further clarification of gene-disease associations

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central